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Field of the Invention 

S 

The present invention relates to a method and system tor decompression of 
compressed data and other highly correlated data streams. 

Background of invention 

10 

image and data compression is of vital importance and has great significance in many 
practical applications. To choose between lossy compression and lossless 
compression depends primarily on the application. 

IS Some applications require a perfectly lossless compression scheme so as to achieve 
zero errors in the automated analysis. This is particularly "relevant when where an 
automatic analysis is performed on the image or data. Generally, Huffman coding and 
other source coding techniques are used to achieve lossless compression of image 

20 

In certain other applications, the human eye visually analyzes images. Since the 
human eye is insensitive to certain patterns in the images, such patterns are 
discarded from the original images so as to yield good compression of data. These 
schemes are termed as "visually lossless" compression schemes. This is not a 
25 perfectly reversible process as the de-compressed image data is different from the 
original image data. The degree of difference depends on the quality of compression, 
and the compression ratio. Compression schemes based on discrete cosine 
transforms (DCT) and Wavelet transforms followed by lossy quantization of data are 
typical examples of visually lossless scheme. 

30 

As a general rule, it is desirable to achieve the maximum compression ratio with zero, 
or minimal, possible loss in the quality of the image. At the same time, the complexity 
involved in the system and the power consumed by the image compression system 
are important parameters when it comes to a hardware-based implementation. 
35 , 

Usually, image compression is carried out in two steps. The first step is to use a pre- 
coding technique, which is normally based on signal transformations. The second 



step would be to further compress the data values by standard source coding 
■techniques such as, for example, Huffman and Lempel-Ziv schemes. 

Most efficient compression techniques require a transformation. This is also known as 
pre-coding. The initial pre-coding step is the most critical and important operation in 
image compression. The complexity involved with DCT and Wavelet based 
transformations is quite high because of the large number of multiplications involved. 
This is illustrated in the following DCT equation: 



In addition to the large number of multiplications involved in carrying out the above 
DCT equation, there is also a zigzag rearrangement of the image data, which involves 
additional complexity. DCT transformation uses a mathematical algorithm to generate 
frequency representations of a block of video pixels. DCT is an invertible, discrete 
orthogonal transformation between time and frequency domain. 

Transformation aids in increasing the efficiency of a second step, the entropy coder. 
At this stage, if the entropy coder produces good compression ratios, then the pre- 
ceding should transform the data into a form suitable for the entropy coder. If the 
transformation is not efficient, then the entropy coder is becomes redundant. Thus, 
pre-coding is the most important stage of any image compression algorithm. 

Another important property of any transformation is that it is reversible, to allow the 
reverse process to be applied at the decompression stage to obtain the original image. 
This transformation is extensively used in JPEG algorithms and their variants. 

However, DCT suffers from several problems. Firstly, the complexity of the equation in 
terms of the number of multiplications and additions. In the 2D case, with an array of 
dimension N x N, the number of multiplications is in the order of 2N* using a separable 
approach of computing 1D row and column DCTs. Specifically, for an 8 x 8 pixel array 
which is used in the JPEG family. 1024 multiplications and 896 additions are required. 
There have not been any signrficant improvements to reduce this computational 
overhead. 
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Even though the image data is an integer, their multiplication to cosine terms in the 
formula produces fractional numbers or real numbers because cosine values are 
fractional in nature until and unless they are integer multiples of Pi, which may not be 
5 the case. Since fractional numbers need infinite precision to store them exactly, they 
might produce errors in the reverse process, resulting in loss. 

Another popular transformation is the wavelet transform. This is used, for example, in 
the JPEG2000 image compression standard. A mother wavelet is used to decompose 
10 the image data into frequency sub-bands, which in turn increases the redundancy in 
most of the sub-bands, thereby improving compression ratios. Used in their original 
form, the mother wavelets do not give integer-to-integer transformation but when used 
after a process called lifting, they come integer-to-integer transforms. This makes the 
entire process lossless but does not achieve a high compression ratio. 

IS 

Color transformations also offer another compression technique. A commonly used 
color space is RGB. In RGB, every pixel is quantized by using a combination of Red, 
Green and Blue values. This format is popular among graphic designers, but is not 
ideal as a compression algorithm. 

20 

It is desirable to provide an image compression system which does not involve 
rigorous transforms, and complex calculations. It also has to be memory efficient and 
power efficient. 

25 There are various image compression techniques presently available. A familiar few 
are JPEG, JPEG-LS, JPEG-2000, CALIC, FRACTAL and RLE. 

JPEG compression is a trade-off between degree of compression, resultant image 
quality, and time required for compression/decompression. Blockiness results at high 

30 image compression ratios. It produces poor image quality when compressing text or 
images containing sharp edges or lines. Gibb's effect is the name given to this 
phenomenon - where disturbances/ripples may be seen at the margins of objects with 
sharp borders. It is not suitable for 2-bit black and white images. It is not resolution 
independent, and does not provide for scalability, where the image is displayed 

35 optimally depending on the resolution of the viewing device. 



JPEG-LS does not provide support for scalability, error resilience or any such 
functionality. Blockiness still exist at Higher compression ratios and it does not offer 
any particular support for error resilience, besides restart markers. 

JPEG-2000 does not provide any truly substantial improvement in compression 
efficiency and is significantly more complex than JPEG, with the exception of JPEG- 
LS for lossless compression. The complexity involved in JPEG-2000 is higher for a 
lower enhancement in the compression ration and efficiency. 

Although CALIC provides the best performance in lossless compression, it cannot be 
used for progressive image transmission as it implements a predictive-based 
algorithm that can work only in lossless/nearly-lossless mode. Complexity and 
computational cost are high. 

All data compression techniques are based on the fundamental principle of Shannon's 
Information Theory. This theory states that there is a limit to the number of bits 
required to code a unique symbol, also known as entropy. This is given as the 
following equation: 

: H:«>;Pi:lpga ; pr 

Where Pi is the probability of occurrence of the symbol. The implication of this 
equation is that if a symbol occurs frequently, then this symbol contributes to repetition 
and Is designated a lower priority when compared to a symbol whose frequency of 
occurrence is less. This forms the basis for all the entropy coding or source coding 
schemes. A shorter codeword is given to more probable events. For example, the 
more frequently the symbol occurs, the shorter its codeword is. Most encoders allot a 
minimum of at least one bit per symbol. 

Image data follows a Palladian distribution. This means that the occurrence of each 
symbol is equiprobable. Thus, all the symbols require almost the same number of bits 
which results in very low compression ratios. 



Summary of the Invention 



It is essential for a decompressed image to be identical to the original image. Once 
• information has been discarded during compression it cannot be retrieved. Thus, if 
during compression information was discarded, decompression only provides an 
5 approximation of the original image. 

The algorithm of the present invention does not employ approximations and is able to 
guarantee a perfect reconstruction of the lossless compressed image. The codification 
technique of the present invention is ideally suited for highly correlated data and digital 
10 images, as there is high accuracy requisition, and certain errors and distortions can 
affect the image quality. 

The present invention maintains full fidelity of the original digital data during the 
process of compression and decompression in a perfectly lossless manner. This 
15 renders a high compression ratio as there, is no prediction or quantization or 
approximation in the process. 

The present invention is a perfectly reversible technique of compression. The logical 
transform ensures a lossless and on the fly decompression. The original image data is 
20 retrieved perfectly lossless. As the reversion algorithm of the present invention is also 
an integer-to-integer transformation, loss of data is eliminated. The retrieved image is 
visually and mathematically identical to the original image data. 

Firstly, decompression involves a decoding process. A run length decoder follows the 
25 arithmetic decoding. The decoded data is then passed for reverse transformation and 
data rearrangement. The reverse transformation primarily has four variants: horizontal 
variant, vertical variant, predict variant and the multidimensional variant. 

The horizontal variant, vertical variant, and predict variant are classified as being in 
30 the one dimensional category. The multidimensional variant is classified as being in 
the two dimensional category. The data rearranging comprises a reversible sort 
process and a last to first rearrangement. 

The method may be used for an application selected from: medical image archiving, 
35 medical image transmission, database system, information technology, entertainment, 
communications applications, and wireless application, satellite imaging, remote 
sensing, and military applications. 
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Brief Description, of the Drawings 

In order that thB invention may be fully understood and readily put into practical effect, 
5 there shall now be described by way of non-limitative example oniy a preferred 
embodiment of the present invention, the description being with reference to the 
accompanying illustrative drawings in which: 

Figure 1 illustrates the entire image compression system based on repetition coded 
10 compression op a hardware implementation; 

Figure 2 is a sample grayscale image of a human brain, which is captured by 

magnetic resonance imaging ("MRI") to demonstrate the compression able to be 

achieved by repetition coded compression system; 

Figure 3 is an enlarged image of a small region from Figure 2; 
15 Figure 4 shows that the image of Figure 2 is made up of many pixels in grayscale; 

Figure 5 shows a 36-pixel region within the sample MRI image of Figure 2; 

Figure 6 shows the ASCII value equivalent of the image data values for the image of 

Figure 2; 

Figure 7 shows the application of repetition coded compression along the horizontal 
20 direction in the image matrix; 

Figure 8 shows the application of repetition coded compression along the vertical 
direction in the image matrix; 

Figure 9 shows the combination of horizontal and vertical bit-planes by a binary 
addition operation; 

25 Figure 10 shows the total memory required for the 36-pixel region before and after 
applying repetition coded compression; 

Figure 11 shows the application of repetition coded compression to the entire image; 
and 

Figure 12 shows the operational flow for the implementation of repetition coded 
30 compression. 

Detailed Description of Preferred Embodiments 

Image data is a highly correlated, this means that the adjacent data values in an 
35 image are repetitive in nature. Therefore, it is possible to achieve some compression 
out of this repetitive property of the image and then apply Huffman coding or other 
source coding schemes. Such a method would be very efficient. 
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In repetition coded compression ("RCC"), each element is compared with the previous 
element. If both of them are equal then a value of "1" is stored in a bit-plane. 
Otherwise a value of '0' is stored in the bit-piane. Only the difference value is stored 
in a matrix, instead of storing all the repeating values. 

In a one-dimensional performance of the method, only one bit-plane is used to code 
the repetition in the horizontal direction. 

But in a two-dimensional performance of the method, two bit-planes are used to code 
the repetitions in both the horizontal and the vertical directions. This is more efficient 
and gives a better compression ratio. 

The compression system is based on a mathematical comparison of adjacent image 
data values. The comparison is performed between adjacent image data values in 
both the horizontal as well as vertical directions. The bit-planes formed as a result of 
the comparison in the horizontal and vertical directions are respectively combined by a 
binary addition method. After this the resultant bit-plane positions are called as RCC 
bit-planes. The zero values in the RCC bit-plane are stored for lossless reconstruction 
of the original image. For lossless reconstruction, they are the only values stored. 
The stored values correspond to the same locations in the original image matrix as 
zeros in the RCC bit-plane and are hereinafter called RCC data values. AH the other 
image data values can be reconstructed by using the RCC data values, and the 
horizontal and vertical bit-planes. 

Figure 1 illustrates the entire image compression system based on repetition coded 
compression on a hardware implementation; The analog image signals 12 are 
captured by. the camera 10 and are converted into respective digital data 16 by a 
analog to digital converter 14. This digital date 16 is rearranged into a matrix of image 
data values by a reshaping block 18. The reshaped image matrix is stored in the 
embedded chip 20, which performs the entire repetition coded compression system. 
This therefore gives the compressed repetition coded compression data values 22 and 
also the bit-planes of data 24 for storage, archival and future retireval 26. 



Figure 2 is a sample image of the human brain which is captured by magnetic 
resonance imaging (MRI). This sample image may be used to demonstrate the 
compression achieved by repetition coded compression, it is a grayscale image. 
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Figure 3 zooms a small region from the sample MRI image of the human brain. This 
zoomed region mfy also be used for demonstrating the repetition coded compression 
system. 

Figure 4 shows that the image is made up of lot of pixels in grayscale. 

Figure 5 shows a 36-pixel region within the sample MRI image of the human brain. 

Figure 6 shows the ASCII value equivalents of the image data values which are 
originally used for data storage. Each value requires eight bte (1 byte) of data 
memory. Currently, the 36-pixel region requires about 288 bits or 36 bytes of data 
memory. That data could be compressed and stored with only 1 12 bits after repetition 
coded compression. 

Figure 7 shows the application of repetition coded compression along the horizontal 
direction in the image matrix. This results in the horizontal bit-plane and also the 
horizontal values stored. 

Figure 8 shows the application of repetition coded compression along the vertical 
direction in the image matrix. This result in the vertical bit-plane, and also the vertical 
values stored. 
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Figure 9 shows the combination of horizontal and vertical bit-planes by a binary 
addition operation. This results in only five zero values which correspond to the final 
values stored from the original image matrix 

Figure 10 shows the total memory required for the 36-pixel region before and after 
applying repetition coded compression. The original memory requirement was 288 
bits. After applying repetition coded compression the memory required was 112 bits. 
This is a great amount of compression. 

Figure 11 shows the application of repetition coded compression to the entire image. 
The size is compressed to 44,000 bits from the original 1 88,000 bits. 

Figure 12 shows an implementation of repetition coded compression. The image 
matrix 1201 is transposed 1202, encoded along the horizontal 1203 and vertical 1204 
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directions and the respective bit-planes 1205. 1206 are derived. Further compression 
. is achieved by combining the horizontal and vertical bit-planes 1203, 1204 by a binary 
addition operation. This results in the repetition coded compression bit-plane 1207, 
which is logically inverted 1208 and compared 1209 with the original Image matrix 
1201 to obtain the final repetition coded compression data values 1210. The 
repetition coded compression data values 1210, together with the horizontal and 
vertical 1 206bit-planes are stored in a data memory 1211 for archival and future 
retrieval. 

The coded data can be further compressed by Huffman coding. This compression of 
the image data is achieved using the repetition coded compression system. This 
system is fast as it does not make use of complex transform techniques. The method 
may be used for any type of image file. In the example given above, the system is 
applied only for grayscale images. It may be applied to color images. 

The system of repetition coded compression of images may be applied to fields such 
as. for example, medical image archiving and transmission, database systems, 
information technology, entertainment, communications and wireless applications, 
satellite imaging, remote sensing, military applications. 

The preferred embodiment of the present invention is based on a single mathematical 
operation and requires no multiplication for its implementation. This results in 
memory efficiency, power efficiency, and speed, in performing the compression. 
Because of the single mathematical operation involved, the system is reversible and 
lossless. This may be Important for applications which demand zero loss. The 
compression ratios may be significantly higher than existing lossless compression 
schemes. 

Applications 

Repetition Coded Compression (RCC) can be used in applications for medical 
imaging, digital entertainment and document management. Each of these verticals 
requires RCC to be implemented in a unique way to deliver a robust and powerful end 
product. 

RCC can be deployed in the following forms for commercialisation; 

1) ASIC or FPGA chips 

2) DSP or embedded systems 



3) Standalone, hardware boxes 

4) Ucensable software (as DLLs or OCX) 

5) Software deliverables 



Whilst there has been, described in the foregoing description a preferred embodiment 
of the present invention, it will be understood by those skilled in the technology that 
many variations or modifications in details of design, constructions or operation may 
be made without departing from the present invention. 

A method for decompressing compressed data, comprising: 

run-length decoding the compressed data; 

arithmetically decoding the compressed data; 

reverse transforming the decoded data; and 

rearranging, the transformed decoded data into a lossless decompressed 

form. 

the reverse transformation is one 
dimensional including a horizontal variant, a vertical variant, or a predict 
variant. 

... The reverse transformation is two 
dimensional such as a multidimensional variant. 

■"the rearrangement of the 
transformed decoded data comprises a reversible sort process and a last to 
first rearrangement. 

• ■ _ "the compressed data is image 

data. 

The image data originates from a 

photo, drawing or video frame. 

fhe method is used for an 
application selected from the group consisting of: medical image archiving, 
medical image transmission, database system, information technology, 
entertainment, communications applications, and wireless application, satellite 
imaging, remote sensing, 

A system for decompressing compressed data, comprising: 

a run-length decoder and an arithmetic decoder for decoding the compressed 

data; 

a reverse transforming module to reverse transform the decoded data; and 
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a data rearranging module to rearrange the transformed decoded data into a lossless 
decompressed form. 



reverse transformation is one 

dimensional including a horizontal variant, a vertical variant, or a predict 



reverse transformation is two 

dimensional such as a multidimensional variant. 

Hie rearrangement of the 
transformed decoded data comprises a reversible sort process and a last to 
first rearrangement. 

the compressed data is image data. 

~ - 1he image data originates from a 

photo, drawing or video frame. 
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Sample MRI of Human Brain (Pixel View) 
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